Goto

Collaborating Authors

 standard model


0fe6a94848e5c68a54010b61b3e94b0e-Supplemental.pdf

Neural Information Processing Systems

Post-hoc gradient-based interpretability methods [1, 2] that provide instancespecific explanations of model predictions are often based on assumption (A): magnitude of input gradients--gradients of logits with respect to input--noisily highlight discriminative task-relevant features. In this work, we test the validity of assumption (A) using a three-pronged approach: 1. We develop an evaluation framework, DiffROAR, to test assumption (A) on four image classification benchmarks. Our results suggest that (i) input gradients of standard models (i.e., trained on original data) may grossly violate (A), whereas (ii) input gradients of adversarially robust models satisfy (A) reasonably well.







Efficient Probabilistic Inference in the Quest for Physics Beyond the Standard Model

Neural Information Processing Systems

We present a novel probabilistic programming framework that couples directly to existing large-scale simulators through a cross-platform probabilistic execution protocol, which allows general-purpose inference engines to record and control random number draws within simulators in a language-agnostic way. The execution of existing simulators as probabilistic programs enables highly interpretable posterior inference in the structured model defined by the simulator code base. We demonstrate the technique in particle physics, on a scientifically accurate simulation of the tau lepton decay, which is a key ingredient in establishing the properties of the Higgs boson. Inference efficiency is achieved via inference compilation where a deep recurrent neural network is trained to parameterize proposal distributions and control the stochastic simulator in a sequential importance sampling scheme, at a fraction of the computational cost of a Markov chain Monte Carlo baseline.


Quasiprobabilistic Density Ratio Estimation with a Reverse Engineered Classification Loss Function

arXiv.org Machine Learning

We consider a generalization of the classifier-based density-ratio estimation task to a quasiprobabilistic setting where probability densities can be negative. The problem with most loss functions used for this task is that they implicitly define a relationship between the optimal classifier and the target quasiprobabilistic density ratio which is discontinuous or not surjective. We address these problems by introducing a convex loss function that is well-suited for both probabilistic and quasiprobabilistic density ratio estimation. To quantify performance, an extended version of the Sliced-Wasserstein distance is introduced which is compatible with quasiprobability distributions. We demonstrate our approach on a real-world example from particle physics, of di-Higgs production in association with jets via gluon-gluon fusion, and achieve state-of-the-art results.


Infamous 3I/ATLAS comet is covered in ice volcanoes, surprising astronomers

Popular Science

It's still not aliens, but the interstellar comet keeps getting weirder. Breakthroughs, discoveries, and DIY tips sent every weekday. As comet 3I/ATLAS continues its exciting journey through our solar system, scientists are still learning everything they can about this special space rock. It is only the second interstellar object ever tracked through our solar system and is among the fastest comets ever observed. As the 3I/ATLAS nears its closest distance to Earth, an international team of astronomers now says the space rock may be covered in active, icy cryovolcanoes.


Discovering the Underlying Analytic Structure Within Standard Model Constants Using Artificial Intelligence

arXiv.org Artificial Intelligence

This paper presents a method for uncovering hidden analytic relationships among the fundamental parameters of the Standard Model (SM), a foundational theory in physics that describes the fundamental particles and their interactions, using symbolic regression and genetic programming. Using this approach, we identify the simplest analytic relationships connecting pairs of these constants and report several notable expressions obtained with relative precision better than 1%. These results may serve as valuable inputs for model builders and artificial intelligence methods aimed at uncovering hidden patterns among the SM constants, or potentially used as building blocks for a deeper underlying law that connects all parameters of the SM through a small set of fundamental constants.